Online Learning and Blackwell Approachability in Quitting Games
نویسندگان
چکیده
We consider the sequential decision problem known as regret minimization, or more precisely its generalization to the vectorial or multi-criteria setup called Blackwell approachability. We assume that Nature, the decision maker, or both, might have some quitting (or terminating) actions so that the stream of payoffs is constant whenever they are chosen. We call those environments “quitting games”. We characterize convex target sets C that are Blackwell approachable, in the sense that the decision maker has a policy ensuring that the expected average vector payoff converges to C at some given horizon known in advance. Moreover, we also compare these results to the cases where the horizon is not known and show that, unlike in standard online learning literature, the necessary or sufficient conditions for the anytime version of this problem are drastically different than those for the fixed horizon.
منابع مشابه
Approachability of convex sets in generalized quitting games
We consider Blackwell approachability, a very powerful and geometric tool ingame theory, used for example to design strategies of the uninformed player inrepeated games with incomplete information. We extend this theory to “generalizedquitting games”, a class of repeated stochastic games in which each player may havequitting actions, such as the Big-Match. We provide three simpl...
متن کاملBlackwell Approachability and No-Regret Learning are Equivalent
We consider the celebrated Blackwell Approachability Theorem for two-player games with vector payoffs. Blackwell himself previously showed that the theorem implies the existence of a “noregret” algorithm for a simple online learning problem. We show that this relationship is in fact much stronger, that Blackwell’s result is equivalent to, in a very strong sense, the problem of regret minimizati...
متن کاملApproachability in Stackelberg Stochastic Games with Vector Costs
The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optim...
متن کاملA Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games
The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optim...
متن کاملA Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games
The notion of approachability was introduced by Blackwell ([8]) in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector-cost of a given player towards a given target set, irrespective of the strategies of the other players. In this paper, motivated from the multi-objective...
متن کامل